PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID NNU_006551-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; stem eudicotyledons; Proteales; Nelumbonaceae; Nelumbo
Family B3
Protein Properties Length: 937aa    MW: 106958 Da    PI: 10.766
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
NNU_006551-RAgenomeCASView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B356.64.9e-18341131397
                    TT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE CS
             B3  13 sgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvf 97 
                     + l +p kf+ +h  kke s  ++l+ +sg++W+v++ +   +gr++++ GW+ Fv++n Lke+ ++vF+++g+s f   v +f
  NNU_006551-RA  34 DQKLNIPPKFM-HHV-KKEISSSVSLRGPSGKTWKVEIVM-DEDGRCFFQHGWNTFVQENSLKEKNVLVFRYTGNSIF--DVLLF 113
                    33489******.553.4568889***************65.5557899*************************99888..66665 PP

2B350.53.6e-16267355193
                    EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE..E CS
             B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sefelv 93 
                    f+ v+ p +v k+  l++p   ++ h ++  +s +++  + +g++W+v ++ r  +++ +l  GW +Fv  n+L+egD++vF++ +r ++f l+
  NNU_006551-RA 267 FKIVMQPTHVYKRFYLTIPAAAVRMHFLP--RSEDVI-LSVDGKTWRVMFRSRSPGQGAFLN-GWPKFVLHNNLEEGDVCVFEVGKRrNNF-LH 355
                    67789999******************998..454555.778*********766666667777.********************87553444.33 PP

3B355.88.3e-184054821699
                    EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
             B3  16 lvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99 
                      +p++f+++++g+   + t +l++++ ++W+v++  +k +++ +l+kGW++Fvk + L +gD ++F++ g  +f   vk++++
  NNU_006551-RA 405 CKIPRPFIKHFNGS--VPATFILRSPTKKRWRVRV--KKVKEQWYLQKGWQSFVKFHSLVVGDLLIFSYRGGAKF--SVKIYDR 482
                    47******877555..3568***************..9999999************************9775555..9999987 PP

4B3665.5e-21570660199
                    EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEE CS
             B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkv 96 
                    ffkvl+ +d +k+  l +p  f+++++g+   +k ltl++++g+sW vk+  +k ++++++ +GW++Fv+ + L  gDf+vF+++g+s+f   vk+
  NNU_006551-RA 570 FFKVLVAPDFSKL--LGIPPLFIKHFNGSV--PKRLTLRSPTGKSWPVKV--KKIDEKFYFHTGWQRFVEHHSLVWGDFLVFSYNGKSKF--SVKI 657
                    99***99999987..9*******8875553..456***************..********************************998999..9999 PP

                    E-S CS
             B3  97 frk 99 
                    +++
  NNU_006551-RA 658 YDR 660
                    987 PP

5B340.64.5e-138319061390
                    TT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE CS
             B3  13 sgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sef 90 
                    ++ +v+pk  a++ g+   ++ + +l d++grsW+v +   +++gr+ + +GW  F kan++ eg+ + F++++    +
  NNU_006551-RA 831 RSHMVVPKAVARKVGIT--GKGKAVLLDPKGRSWRVSF-APRTDGRVDIISGWAAFWKANNIVEGEACHFEFIQGtVAG 906
                    5669*******999866..4448***************.57777889999*********************99654555 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019361.75E-2121114IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.101.5E-2222114IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086314.35322116IPR003340B3 DNA binding domain
CDDcd100177.63E-1623114No hitNo description
SMARTSM010193.5E-1225116IPR003340B3 DNA binding domain
PfamPF023624.4E-1535109IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.3E-18264362IPR015300DNA-binding pseudobarrel domain
CDDcd100171.70E-19266362No hitNo description
SuperFamilySSF1019368.63E-19266356IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086312.844267362IPR003340B3 DNA binding domain
SMARTSM010191.6E-15267364IPR003340B3 DNA binding domain
PfamPF023624.2E-14267353IPR003340B3 DNA binding domain
SMARTSM010191.6E-10396483IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.6E-19403485IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.51E-19405490IPR015300DNA-binding pseudobarrel domain
PfamPF023624.5E-15405482IPR003340B3 DNA binding domain
PROSITE profilePS5086313.394407483IPR003340B3 DNA binding domain
CDDcd100176.95E-14407481No hitNo description
Gene3DG3DSA:2.40.330.101.1E-24564663IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019364.9E-25566668IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086314.128568661IPR003340B3 DNA binding domain
CDDcd100173.92E-19568659No hitNo description
SMARTSM010195.5E-18570661IPR003340B3 DNA binding domain
PfamPF023626.6E-19570660IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.105.4E-17813917IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019369.81E-17815916IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086312.421819918IPR003340B3 DNA binding domain
SMARTSM010191.8E-7822914IPR003340B3 DNA binding domain
CDDcd100176.29E-10829911No hitNo description
PfamPF023621.4E-11830908IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005773Cellular Componentvacuole
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 937 aa     Download sequence    Send to blast
MGGRCDECRS SDEYVYWTHF QSRQFFQILT NDFDQKLNIP PKFMHHVKKE ISSSVSLRGP  60
SGKTWKVEIV MDEDGRCFFQ HGWNTFVQEN SLKEKNVLVF RYTGNSIFDV LLFDQFSLCE  120
KVSSYFPSKC GCNLKESSTE SSVEFIYSSV VDVDERKRGR GGRGIKCSPS FRSKGKEAER  180
SEPFSWRNTS QAKRKKIEKK SATPLPAKRK PKREEPADYS GDEELSESLN RAFHIHFLSK  240
RRPVTEVEML RTLELANEAL VLSETSFKIV MQPTHVYKRF YLTIPAAAVR MHFLPRSEDV  300
ILSVDGKTWR VMFRSRSPGQ GAFLNGWPKF VLHNNLEEGD VCVFEVGKRR NNFLHIDVKI  360
FRVVEEVVPL QFPLSTCLYI MSPKAMFVGW CPPAKNVKPL PLRKCKIPRP FIKHFNGSVP  420
ATFILRSPTK KRWRVRVKKV KEQWYLQKGW QSFVKFHSLV VGDLLIFSYR GGAKFSVKIY  480
DRSACEKELP FSSSAYTQRN PVDSSHPHPK KGRRGERKAG GLQKPVWERP SVQDSKSIFR  540
PKEIRWVKVA RPYTVEMGRM IPYRRSRPSF FKVLVAPDFS KLLGIPPLFI KHFNGSVPKR  600
LTLRSPTGKS WPVKVKKIDE KFYFHTGWQR FVEHHSLVWG DFLVFSYNGK SKFSVKIYDR  660
SSCERDLPSA PSNVHSQPRK QKRGKEMARV TTEVVVQNHG MSASHPNKEK PGTLMARART  720
KVIAQIYESA SDSHPRKEKK GDVIARSTKK GIDRALRKSY SHPKMDKCGK AMAKSTMKAV  780
VSSKGNGGSG KPRKKSNYGI ERQPRSEAIE AASSYKTEFP HFAALCGKSR RSHMVVPKAV  840
ARKVGITGKG KAVLLDPKGR SWRVSFAPRT DGRVDIISGW AAFWKANNIV EGEACHFEFI  900
QGTVAGKVII CVNIFRAAGF GELARFSNII PMAFFGL
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1677684PRKQKRGK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00468DAPTransfer from AT4G33280Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010255061.10.0PREDICTED: B3 domain-containing protein Os01g0723500-like
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.13e-49B3 family protein